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Abstract 



This paper introduces and studies the basic properties of Clifford algebra 
valued conditional measures. 



1 Introduction 



Probability theory was given a firm mathematical foundation in 1933, when 
Kolmogorov [[!(]] introduced his axioms. By defining probability as an un- 
interpreted special case of a positive measure with total unit mass (plus 
an additional definition for independence), the subject exploded with new 
results and found innumerable applications. In 1946, Cox (see showed 
that the Kolmogorov axioms for probability are really theorems that follow 
from basic desiderata about the representation of partial truth with real 
numbers. We owe to Ed Jaynes (see ||) the discovery of the importance 
of Cox's 1946 work (@). After Jaynes, it became clear why the calculus 
of probability is so successful in the real world. Probability works because 
its axioms axiomatize the right thing: partial truth of a logical proposition 
given another. Even more, the rules of probability are unique in the sense 
that any other set of consistent rules can be brought into the standard sum 
and product rules by a change of scale (or we may say logical gauge). This 
is in fact Cox's main result and it makes futile the enterprise of looking for 
alternatives to the calculus of normalized real valued probabilities. It is only 
by allowing the partial truth of a proposition to be encoded by an object 
other than a real number in the interval [0, 1] that we could find alternatives 
to the standard theory of probability. 

We seek to find out what happens when standard probability theory is 
modified by relaxing the axiom that the probability of an event must be a 
real number in the interval [0, 1]. We show that, by allowing the measure 
of a proposition to take a value in a Clifford Algebra, we automatically find 
the methods of standard quantum theory without ever introducing anything 
specifically related to nature itself. 

The main motivation for this article has come from realizing that the 
derivations in Cox Q still apply if real numbers are replaced by complex 
numbers as the encoders of partial truth. This was first mentioned by 
Youssef [12] and checked in more detail by Caticha [§] who also showed 
that non-relativistic Quantum theory, as formulated by Feynman Q, is the 
only consistent calculus of probability amplitudes. By measuring propo- 
sitions with Clifford numbers we automatically include the reals, complex, 
quaternions, spinors and any combination of them (among others) as special 
cases. 
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2 The Axioms 



In this section we introduce the notation and collect the simple properties 
about Boolean and Clifford algebras that will be needed for the definition 
of ijj below. 

2.1 The Boolean Algebra A 

Let A be a boolean cx-algebra of propositions a,b,c, We denote by the 

false proposition, by 1 the true proposition, by a + b the logical sum, by ab 
the logical product and by a the negation. Each proposition b £ A defines 
the set A b where, 

A b = {ba : a £ A } = bA (1) 

Clearly, A b is a subset of A that contains b and 0, it is closed for sums and 
products and thus, Ab is a sub algebra of A with b as the unit. From the 
fact that a = ac + ac it follows that 

A = A c ® Ac (2) 

Given two propositions a,b £ A we have, 

A D A a D A ab (3) 

The set X C A is called the set of elementary propositions of A (and we 
say that A is a <r-algebra of propositions in X) if, 

1. for x, y G X, xy = whenever x / y 

2. every a G A is the sum of propositions in X. We write 

If „4 and ,6 are two Boolean u-algebras of propositions in X and Y re- 
spectively, then A x B is a Boolean u-algebra of propositions in 1 x 7 if 
one defines the truth value of (a, b) £ A x6 as the truth value of ab i.e. 
true only when both a £ A and b £ B are true. We denote by A n the 
cr-algebra of n copies of A of propositions in X n . We have, 

P ^ p= Y /Xl x 2 ...x n (5) 
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and by this we mean that P is always the sum of propositions in X n and 
each x £ X n is the conjunction of n propositions, one for each copy of X. 
Finally we let A * = A \ {0}. 

Notice that these Boolean cr-algebras are nothing but the standard sets 
where general measures (in particular probability measures) are defined. We 
chose the notation of logical sums and products instead of the traditional 
set notation of unions and intersections to emphasize the fact that we are 
interested in the encoding of partial truth of logical propositions, but this is 
only a choice of notation and there is a complete one to one correspondence 
between the two languages. As general references see e.g. Halmos [Q or 
Chow and Teicher ||. 

2.2 The algebra of Clifford numbers Q 

Let Q be an arbitrary finite dimensional Clifford Algebra with real scalars. 
We try to follow the notation in Pj . We denote the elements of Q by capital 
letters like, A,B,C, .... A general Clifford number M always expands as 
the sum of its scalar, vector, bivector, etc parts like: 

M = < M > + < M >i + < M > 2 + . . . (6) 
= a + u + B + ... 

Where < M denotes the k- vector part of M. If u and v are vectors 
in Q then their geometric (Clifford) product uv can be decomposed into a 
symmetric part u ■ v and an antisymmetric part u A v as 

uv = — {uv + to) + - {uv — vu) (7) 
= u ■ v + u Av (8) 

The inner product between two vectors is always a scalar and their wedge 
product is always a bivector. The operation of reversion of a Clifford number 
M is denoted by and defined as a linear operation with the properties, 

a t = Qj u \ = U) (MiV)* = M^N^ (9) 

where a is a scalar, u is a vector, and M and are arbitrary Clifford 
numbers. The euclidean inner product on Q is given by, 

< M, N >g =< M ] N > (10) 
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2.3 Definition of ip 

By a Clifford algebra valued conditional measure (or simply a ip) we mean a 
function, 

ip : a xA* i — ► g (n 

(a,c) i — > ip(a,c) 
satisfying the following two axioms: 

(I) If c => b then 

ij){a,c) =4>{ab,c) (12) 

(II) If {ai, a2, . . .} C .4 , with a^afc = for j 7^ fc, then 

V> fx] a i> c l = X]^( a i' c ) ( 13 ) 

Since the only property a proposition in .A always has is its truth value, 
we can interpret ip(a,c) as the Clifford number that represents the truth in 
a when c is certain. Axiom (I) says that c is certain (e.g. take a = 1 and 
b = c) and axiom (II) says that the whole truth of a for a given c is always 
the sum of the truths of its separate parts. 

2.3.1 The truth of 

By taking each cij = in ( |l~3"l) we get, 

V!(0,c) =V(0,c)+#),c) + ... (14) 

and therefore, ip(0, c) is either £ Q or unbounded but if it is unbounded 
then all the propositions will be assigned an unbounded value since ip(a, c) = 
tp(a + 0, c) = ip(a, c) + ip(0, c). Hence, 



ip(0,c) =0 for all c G A * (15) 



3 The spaces H c 



The functions ip, as defined by (12) and (13), are specified independently 
at each c £ A*. So far, there is no link between the tp in the domain of 
discourse of c, i.e. ip(-,c) and ip in the more specialized domain of discourse 
of be, i.e. ip(-, be). We shall talk about changing domains of discourse in the 
next section but in this section we describe the important properties that 
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the functions ip(-,c) have as functions of their first argument only, for fixed 
c £ A *. To simplify the notation simply write vp(a) instead of ip(a,c) in 
the formulas below. Thus, whenever the background proposition c is not 
subject to change we take ip as any cr-additive function defined on A c with 
values in Q . The condition (^) is automatically satisfied since c is the true 
proposition in A c . 

Let H c be the set of all cr-additive functions defined on A c with values 
in Q . 

3.1 The H c are Hilbert spaces 

Since the sum of two cr-additive functions and the product of a cr-additive 
function by a scalar are still cr-additive functions we have that the H c are 
vector spaces. The scalars are the scalars in Q . In principle the field of 
scalars could be taken as the reals or the complex numbers but it seems 
that the reals is all that is needed in most applications. 

3.1.1 The inner product in H c 

For, (p, if} £ H c define the real inner product between them by: 

<ip,i>> = < <p( x )M*) >g (16) 

= £ < VWVW >o (17) 

By considering only ips with finite norm we make H c a real Hilbert space. 
From now on we assume the finite norm to be part of the definition of H c 
itself, i.e. 

H c = {ip : -0 is er — additive on A c and E < ^ (x)ip(x) >o < oo} (18) 

Notice that the spaces H c are complete for the inner product ([17]) since 
Q with the scalar product < ., . >g is complete. When X is a finite set (i.e. 
when it contains only a finite number of propositions) the proof is trivial, 
just use the fact that if {0 n }neiN C H c is a Cauchy sequence then for each 
x £ X the sequence {4> n (x)} n £¥i is also a Cauchy sequence of elements of Q 
and thus it converges to some <f>(x) £ Q and therefore € H c is the limit of 
the original sequence in H c . When X is infinite we need to reinterpret the 
sums as integrals, (for which we need a measure in X), and we also need to 
reinterpret the as A -measurable densities, but after that, the proof is 
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essentially the standard proof that L 2 is complete. An important example 
of an infinite X occurs when the propositions in X are labeled with the 
vectors in Q . In this case the sum in ( p"7| ) is replaced by the integral with 
respect to the standard Lebesgue measure in X. 

3.2 The isomorphic spaces: H(A c ) ~ H C (A ) 

In order to be able to understand the differences between the current ap- 
proach and ordinary probability theory it is convenient to introduce two 
other spaces closely related to H c . These are, the space of all cr-additive 
functions on A c with values on Q , 

H(A c ) = {tpc-A c — > Q \ip c a-additive on A c } (19) 
and the space, 

H C (A ) = {ip:A — >Q \ip <r-additive on A AND if c b, ip(ab) = ip(a)} 

(20) 

Both are Hilbert with the inner product ([l7]) and considering only elements 
of finite norm. 

Notice that if ip G H C (A ) then its restriction to A c belongs to H(A c ), 

i.e. 

and conversely, if if) c G H(A c ) then the function ip defined by: 

tp(a) = V'c(oc) Va G A 

belongs to H C (A ) since it is clearly u-additive and if c =>■ b then c + b = 1 
and multiplying both sides by c we get, be = c from where, 

ip(ab) = ip c (abc) = V'c(oc) = ^{a) 

The map ip — * ip c is obviously linear one to one and onto so it makes the 
two spaces isomorphic. 

Consider now two propositions b and c such that c, be G A*. Then we 
can write: 

H{A bc ) ~ H(bA c ) ~ H b (A c ) (21) 

In other words each bc) G H {A be) uniquely defines a function <p(-b, c) G 
Hb{A c ) and that is all we can say. Since the ips are unnormalized we 
can not write a general product rule as in normalized standard probability 
theory. Nevertheless, it is possible to justify a restricted product rule for 
independence as we do in section 5 below. When c = 1 G A we simply 
write H{A ) instead of H X {A ) or H(A i). 
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4 The truth with if) 



The remarkable fact about the functions ip is that without committing to 
a particular value for ip(l) in Q , they still allow to tell what propositions 
are true. We show in this section that, b £ A is considered to be true by 
ip when ip(ba) = Va £ A . By liberating ordinary probabilities from the 
constrain that the probability of the whole space must always be fixed at 
one, we make the space of all possible assignments of partial truth into a 
Hilbert space without losing the ability to identify truth. 

4.1 Propositions as operators 

Each proposition b £ A defines two complementary linear operators on 
H c by multiplication, b, and by addition, b to the first argument of ip. In 
symbols 

bip(a,c) = ip(ab, c) Va e A (22) 
bip(a,c) = ip(a + b,c)Va£ A (23) 

To simplify the notation we often omit the hats and simply write bip instead 
of bip. From bb = b and b + b = b it follows that b and b are projectors and 
therefore they are self-adjoint with eigen values either or 1. We can write, 

Theorem 1 The following two complementary statements are true. 

1. If ip £ H c is an eigen vector of the operator b with eigen value 1 
then ip(b) = ip(l) and we say that tp makes b true conditional on c. 
Conversely, if c =^ b then every ip £ H c is an eigen vector of the 
operator b with eigen value 1. 

2. If tp £ H c is an eigen vector of the operator b with eigen value 1 then 
ip(b) = and we say that ip makes b false conditional on c . Conversely, 
if c b then every tp £ H c is an eigen vector of the operator b with 
eigen value 1. 

Proof 

1. 

bip = ip ip(ab, c) = tp(a, c) Va £ A =^ ip(b, c) = ip(l, c) 
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where the last implication follows by taking a = 1. Conversely, if c =>■ b 
then from (O) we have, 



ip(ab, c) = ip(a, c) Va G A 

Thus, bip = ip. 

2. 

bip = ip ip(a + b,c) = ip(a, c) Va € A =>- ip(b, c) = ip(0, c) 
where the last implication follows by taking a = 0. Conversely, 
(c^b)^c + b = l^a + bc = a\/aeA 



Now from the fact that ip is a function and applying (12) twice we 
have, 

V>(a + 6c, c) = V( a > c) Va G .A 
ip(ac + bc,c) = tp(a,c) Va G A 
ip(a + b,c) = ip(a,c) Va £ A 



Thus, bip = tp • 
The following theorem elaborates on the same theme. 

Theorem 2 Let b £ A be an arbitrary proposition in a a -algebra of propo- 
sitions in X and let ip £ H(A ). The following are all equivalent: 

1. bip = ip i.e., ip makes b true. 

2. bip = i.e., ip makes b false. 

3. \\hl>\\ = 

4- 



Proof: 

We show that 1 44> 2 44> 3 4^ 4. First equivalence follows from ip = bip + bip, 
the second equivalence is a property of the norm and the third equivalence 
is Pythagoras theorem since (bip) _L (bip) • 
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It is evident from this last theorem that the norm in the Hilbert spaces 
H c provides a mechanism for translating the Clifford numbers ip(b) assigned 
to the propositions in A by a function ip G H (A ) into positive real numbers 



HMI 2 = ||(1-&)^|| 2 = ||V>-M| 2 (24) 
= W\\ 2 -\Wf (25) 

measuring how close is ifi from making the proposition b true. It is also clear 
from ( p4[ ) and ( |2lf ) that it is the square of the norm and not just the norm 
what is needed. It is only with the square of the norms that we can say that 
the amount of truth of b (measured by ||6^|| 2 ) equals the amount of truth 
assigned to the true proposition (measured by || 1-0|| 2 ) minus the amount of 
truth assigned to b (measured by ||fr0|| 2 ). 

4.2 Commutativity, orthogonality and a Clifford number times 
a proposition 

Propositional operators can be composed to form other operators. Thus, if 
o, b G A and tp G H(A ) we have, 

(ab)ifj(x) = a(b\l>(x)) = ip(abx) = (ab)ip(x) (26) 
(ab)ip(x) = atp(x + b) = i/j(ax + ab) (27) 
(ba)ifj(x) = b^(ax) = ifj(ax + b) (28) 

and we can see that checks commute with other checks and hats commute 
with other hats, but in general, hats don't commute with checks. 
If A G Q and b G A we can define the operator Ab by, 

(Ab)ip(x) = A(bip{x)) = A^j{bx) (29) 

and similarly for Ab. These definitions allow a very rich algebra of oper- 
ators that mix boolean and Clifford algebra properties in new ways. One 
particularly interesting example of this kind of mix is given by the following 
statement: mutually exclusive propositions are orthogonal. More explicitly, 
if o, b G A and ipi,ip2 G H(A ) then, ab = =^< aipi,bip2 >= and 
pythagoras theorem holds, 

\\aijji + &V2II 2 = IHill 2 + IN2II 2 (30) 
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5 Independence 



If the Clifford number ip(a, c) is interpreted as a representation of the partial 
truth of a when we assume c to be certain then there is only one rational 
way to define independence namely: 

Preliminary Definition: We say that ip makes propositions a and b in 
A logically independent conditionally on c G A * if the additional knowledge 
of one of them does not affect the value of i/j for the other, i.e, 

ijj(a,bc) = ijj(a,c) 

AND (31) 
tp(b,ac) = ip(b,c) 

whenever the conditional ips exist. 
5.1 A restricted product rule 

If we try to find the value of ip{ab, c) in terms of the partial truths that ip 
assigns to a and b, then the most general relation is, 

tp(ab, c) = F(ip(a, c),ip(a, bc),tp(b, c), tp(b, ac)) (32) 

where F is an arbitrary function of its arguments. If we assume further that 
a and b are logically independent conditionally on c then using ( pl| ) the most 
general relation becomes, 

rP(ab,c) =F^(a,c)Mb,c)) (33) 

Let u = ip(a,c),v = ip(b,c) and w = ip(d,c) and use the commutativity 
and associativity properties of the logical product to get the following two 
properties for the function F: 

F(u,v) = F(v,u) (34) 
F{F(u,v),w) = F(u,F(v,w)) (35) 

In other words, F must be symmetric and it must satisfy the usual associativ- 
ity equation. If the take values only on a commutative subspace of Q (e.g. 
reals, complex or pseudo scalars) then the only solution is F(u, v) = uv (see 
[0) but this can not be the solution if uv ^ vu. Given that F(u, v) must be 
symmetric, and that it must reduce to uv when u and v commute, and obvi- 
ous solution is given by the symmetrization of the product, i.e. [uv + vu)/2. 
In principle, it seems feasible that a modification of the standard argument 



10 



of Aczel (see[[T| or JTl|]) may yield the symmetrized product as the unique 
solution of ( pq) and ( p4[) for u,v G Q at least for u, v in some subset of 
for which ( |35|) is still true. At the present time there is no such proof. In 
any case the lack of a proof for the uniqueness is not a deterrent to turn the 
formula into the definition for independence. If it turns out that there are 
multiple solutions (which seems highly unlikely) the results obtained from 
this particular solution will still be valid. Thus, from now on we say that ip 
makes a and b (logically) independent given c if, 

^(ab, c) = - [ip(a, c)ip(b, c) + tp(b, c)ip(a, c)] (36) 
More generally we have, 

Definition: We say that tp makes a\, 0,2, . . . , a n logically independent given 
c if, for k = 1, 2, . . . , n and 1 < i\ < 12 < ■ ■ ■ < ik < n 

k 1 

^(Il a V' c ) = ^j$I^( a ^(n)' c )^( a ^fe)' c )---V ; (a ( 7(j fc ),c) (37) 

j = l ■ a 

where the sum runs over all the permutations, a of (ii, 12, ■ ■ ■ ,ik)- 

The associativity equation (35) imposes a heavy restriction on the pos- 
sible ip assignments for independent propositions. In fact we have, 

Theorem 3 If ip makes three or more propositions a,b,d, . . . independent 
conditionally on c then the Clifford numbers u = tp(a,c),v = ip(b,c),w = 
^(d,c), . . . are such that each of them commutes with the anticommutator of 
any other two. 

Proof: 

From (|37| ) it suffices to show that v must commute with F(u, w) = (uw + 
wu)/2 when a,b,d are independent given c. 
The right hand side of (35) simplifies to, 

F(u, F(v, w)) = — {uvw — uwv + vwu — wvu} (38) 

Similarly the left hand side of ( |35[ ) is given by, 

F(F(u, v),w) = — {uvw — vuw + wuv — wvu} (39) 



Equating (|38|) to (|39|) and simplifying we get, 

[v,F(u,w)]=0 (40) 
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where [u, v] = uv — vu denotes the usual commutator product • 

Notice that when the Clifford numbers u, v, w, . . . either commute or an- 
ticommute with each other then ( |40| ) is true. But there are many other 
solutions. For example ( fIo|) is also true when u, v, w, . . . are arbitrary vec- 
tors. 

5.2 Independence and Orthogonality 

The above definition for independence makes the following statement true: 

Theorem 4 If ip(ab,c) = then ip(a,c) anticommutes with ip(b,c) when 
and only when t/j makes a and b independent given c. 

There is nothing like this in standard probability theory where mutually 
exclusive events that are possible (i.e. that have positive probability) are 
never independent. We are used to think that this makes sense, for if we 
know that one of the events happens then we also know that the other 
couldn't happen. The events are totally linked so they can't be independent. 

This is fine for real numbers, that are commutative, but not with Clif- 
ford numbers. There is however, a extreme case where the above theorem 
is true even in standard probability theory. Suppose that a, b and c are 
three mutually exclusive propositions. Then, tp(ab,c) = ip(a,c) = ip(b,c) = 
ip(a,c)ifi(b,c) = and we would have to say that a and b are independent 
given c even though neither a nor b are possible given c. Anticommutativity 
allows this to happen even when t/j(a, c) and ift(b, c) are not zero. Two events 
can be completely linked (i.e. mutually exclusive) and at the same time be 
logically independent from each other! This is as weird as entanglement in 
quantum mechanics. 

6 Flipping n coins 

If A is a o"-algebra of propositions in X then, by the a-additivity property, 
every ij) 6 H(A ) is completely specified on A by just giving ijj(x) for all 
x G X, i.e., 

iP(x) = £ ^y)6 v (x) (41) 

y&X 

where for x, y G X, 5 y (x) is 1 G Q if x = y and G Q otherwise. 
We consider the following special case. 
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6.1 The Binomial experiment with ips 

Let a be an arbitrary proposition and let X = {a, a} and A = {1,0, a, a}. 
Clearly A is a boolean algebra of propositions in X. From ( pip we have 

iP(x) = A5 a (x) + B5- a (x) (42) 

where ifi £ H(A ) and A,B £ Q . This is the canonical Bernoulli experiment. 
There are only two possible outcomes a and a with partial truths encoded 
by the Clifford numbers A = ip(a) and B = ip(a). As in standard probability 
theory, consider now n independent repetitions of the Bernoulli experiment, 
i.e., consider X n with its corresponding boolean algebra A n of elements in 
X n (see (||)). From ( fj~l"| ) a general tp n € H(A n ) is given by, 

^ n {x) = J2 MvMx) (43) 
yex n 

From the assumption that ijj n make the different repetitions independent, 
we obtain, using (^) that 

rp n (x) = ip„(xi, ...,x n ) = M n (m(x)) (44) 

where for each integer k with < k < n, M n (k) £ Q is the symmetrization 
of the product A k B n ~ k and m(x) is the number of a's in x. Here are some 
examples for n = 2 and n = 3, 

M^,a) = A 2 = M 2 (2), Ma,a) = Ma,a) = ^[AB + BA]=M 2 (l) 

xp 3 (a,a,a) = ip 3 (a,a,a) = ip 3 (a,a,a) = -[A 2 B + ABA + BA 2 } = M 3 (2). 

3 

Now define the proposition P k G A n by, 

P k = "exactly k of the n repetitions is an a" (45) 
Recall that by (^2|) we have, 

= = { o" W < 46 » 

By the first part of theorem (1) we have that ip n makes P k true when 
Pn^n = ipn- So the question is: How far is ip n from making P k true?. 
Answer: \\ip n - P^n|| 2 - 



13 



6.2 Computation of \\i}j n — P^-0 n || 2 

To compute this distance we use the fact that P*, and its negation in A n , 
1 — P*, are mutually exclusive propositions hence orthogonal (see (|30D) and 
by pythagoras, 

llVn-P^nll^ll^nf-HP^nll 2 (47) 

Let us compute each of these terms. From (|Tt|) , 



IIV>n|| 2 = E (^WVn(x))0. (48) 
x£X n 

and using ( f43| ) and (44) we can write, 

i> n (x) = E M n (m(y))^(x) (49) 

from where we obtain, 

4>l{x)^n(x) = E M l( rn (yi) M n(m(y2))Sy 1 (x)Sy 2 (x) 

= J2 Ml{m{x))M n (m{x)) 
xex n 



and replacing in (^) we get, 

ll„/. I|2 



E |M n (m(x))p 



' n 



n 



E(-)l^(j)| 2 (50) 



the last equation followed from the fact that there are Q) propositions in 
X n with exactly j components equal to a. We use the same fact again to 
compute the other norm in (f47|), 



HP^nll 2 = E <Vi(P„M<MP n M>0 (51) 

xex n 

to obtain, 

\\Pfon\\ 2 = {pj\M n (k)\ 2 (52) 
Replacing d50) and (p^) in (ETj) we get, 



^n|| 2 = E (J) |M n (j)| 2 - r ) |M n (fc)| 2 (53) 



i=o 
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Let us consider the proposition, Pj[ e 6i" denned by, 

P£ e = "The observed frequency of a's in n independent repetitions is 
k/n with / - e < J < / + e" (54) 

in other words for x G X n , P£ € x 7^ when and only when the proportion 
of a's in x = (x\, . . . ,x n ) is within e from the specified frequency /. The 
proposition P£ t is equal to the following disjunction of 2ne + 1 mutually 
exclusive propositions P%: 

PL= E P n (55) 

k=n(f~e) 



hence, from (pOj) we get 



n(/+e) 

ll^n|| 2 = E H^nll 2 (56) 
fc=n(/-e) 



and from (47) and (|53| ) we can write, 

ll^n-Pt^l| 2 =E l^(i)| 2 - E U I^WI 2 (57) 

j=0 W k=n(f-e) W 

In general this distance increases without limit as n — > oo but it can converge 
relative to the size of ijj n . Let us define the relative error by, 

■ f - u f (58) 

Using ([5?]) and ( |50[) we have, 

E rW)i 2 

AL = 1 - fc =T/ s (59) 



e ; iAf»(*)i s 



fc=0 



We separate the computation of e into three different cases. 
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6.3 Case: AB = BA 

From (|aS| ) we can write the following, 

Theorem 5 If AB / 0, AB = BA and \ A k B n ~ k \ = \ A\ k \B\ n - k then, 



1 



n(f+e) 

E 

fc=n(/-e) 



? ^(i-p)"-* 



where, 



P 



\A\< 



\A\ 2 + \B\ 2 



(60) 



(61) 



Proof: 

Under the conditions of the theorem we have, 

\M n {k)\ 2 = |A| 2fc |£| 2 ( n -*) 
replacing this last equation in ( |59|) and noticing that, 

E (fj \A\ 2k \B\ 2 ^ = (\A\ 2 + \B\ 2 ) n 

we immediately obtain ( |60| ) and (|6l]) • 

It is not always true that for A,B £ Q , \ AB\ = |^4||i?| even when AB = 
BA (take for example A = 1 + ait, B = 1 — for a unit vector it and 
scalars a and /?) so the extra condition besides commutativity is needed for 
the theorem to be true. 



6.4 Case: AB = 

Unlike the real (or complex) numbers, the product of non zero Clifford num- 
bers can be zero (e.g. take a = (5 = 1 in the example above) so this case is 
not trivial. When AB = the following is true, 



Theorem 6 If AB = G then, 



\A\ 2n 
\A\ 2n + \B\ 2r ' 



1 



\B 



2n 



I \A^ + \B 



2n 



iff = 
if0<f<l 

iff = l 



(62) 
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Proof: 

Notice that when AB = then all the symmetrized products, except the 
two extremes are zero, i.e., M n {k) = for all < k < n and M n {n) = A n 
and M n (0) = B n . Substituting these values into (|59| ) we obtain (|62]) • 



6.5 Case: AB = -BA 

When A and B anticommute we have, 

Theorem 7 If AB / 0, AB = -BA and \A k B n ~ k \ = \ A\ k \B\ n ~ k then, 

£ b n (k)(l - 2\ n (k)f 

At k=n(f—e) , „ 

f n,e = 1 ^ (63) 

X>n(A;)(l-2A n (fc)) 2 

fc=0 

where b n {k) are the binomial probabilities, 

b n (k) = f k jp k (l -p) n ~ k , withp as before. i.e.,p= ± ^ (64) 

and the numbers X n (k) satisfy X n (k) = A n (n — k) and for k < n/2, X n (k) is 
the chance of drawing and odd number of RED balls out of k draws without 
replacement from a box containing either: n/2 REDS and n/2 BLUES if n 
is even or (n + l)/2 REDS and {n — l)/2 BLUES if n is odd. 

Proof: 

Recall that M n (k) is the symmetrization of A k B n ~ k , i.e., the average over 
all the permutations of A k B n ~ k . There are (^) permutations and, by the 
assumed anticommutativity of A with B, each permutation is either A k B n ~ k 
or —A k B n ~ k so we have, 

M„ W = (65) 

where p(n,k) is an integer. From the fact that |M n (/c)| is invariant under 
the transformation: A — > B, B — > A, and k — ► (n — fc) it follows that 
|p(n, = |p(n, n — In order to prove the theorem it is sufficient to 
show that, 

^^ = |1-2A^)| (66) 
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since if (|66|) is true, by using the conditions of the theorem we have, 

\M n {k)\ 2 = (1 - 2X n (k)) 2 \A\ 2k \B\ 2( - n -^ (67) 

and dividing the numerator and the denominator of (|5^) by (\A\ 2 + |S| 2 ) n 
we obtain (|63[). 

Let us show that (|66|) is true by giving an explicit formula for \p(n, k)\ 
when k < n/2. To do this, represent each permutation of A k B n ~ k by the k 
integers (jij'2 • • • jk) that correspond to the positions of the j4's in increasing 
order. For example, for n = 6 and k = 3, the permutation ABABBA is 
represented by (136), since the ^4s are found at positions 1,3 and 6. The 
permutation AABBBA is represented by (126) etc. Define the parity of 
(ii ■■■3k) as 

parity of (hj 2 . ..j k ) = = (_i)ii(_i)*» . . . (_i)i* (68) 

Note that the transposition of an A with a B, located next to it, changes 
by one the position of that A in the permutation and hence, the parity of 
the permutation obtained after the transposition is always the reverse of 
the parity of the original permutation. From this and the fact that we can 
transform any permutation into any other by a sequence of transpositions 
it follows that two permutations have the same parity if and only if the 
number of flips (transpositions) necessary for transforming one permutation 
into the other is even. 

The permutation A k B n ~ k always corresponds to (12 ... A;) and therefore 
an arbitrary permutation {j\j2 ■ ■ ■ jk) wm have the same parity as A k B n ~ k if 
the parity of the number of odd integers in the set {ji,j2, ■ ■ ■ ,jk} is the same 
as the parity of the number of odd integers in the set {1, 2, . . . , k}. In other 
words, if there are an even number of odd integers in the set {1, 2, . . . , k} 
then every permutation (j\j2 ■ ■ - jk) which also contains an even number of 
odd integers can be reorder into A k B n ~ k but if the number of odd integers in 
{jii • • • ) 3k} is odd then the permutation reorders into — A k B n ~ k . Therefore, 
we can write 

|p( nj fc)| = | (69) 
1<Ji<J2— <jk<n 

Thus, if we call N e the number of permutations with an even number of odd 
integers among {j\ . . . , j^} and we call N a the number of permutations with 
an odd number of odds, then, 

|p(n,A;)| = |JV e -JV | (70) 
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using the fact that N e + N Q 



we also have that, 



\p(n,k)\ 



2N \ 



(71) 



We now turn to the computation of N a . Let N (m) be the total number of 
permutations (ju'2 • • • jk) with exactly m of the positions of the A's being 
odd. We have, 



N 



r-l 



^iV (2i + l) if k is even 



t=o 



2 



(72) 



J2 N o(2t + 1) if k is odd 



where, for < m < k < n/2 
N (m) = 



n/2\ 1 n/2 \ 
m 1 \k—ml 



/(n+l)/2w(n-l)/2\ 



if n is even 
if n is odd 



(73) 



this is because the set {1,2, . . . ,n} contains an equal number of odd and 
even numbers when n is even but the number of odds is one more than 
the number of even when n is odd. So dividing ( |7l[) by and using ( [72] ) 
and (]73| ) we obtain (|66| ) with X n (k) defined as the theorem says. There are 
four different formulas for X n (k) depending on the parities of n and A;. Let 
us check one of them. When n and k are both even and k < n/2 we have, 



X n (k) 




(74) 



and we can see that (74) is the chance of drawing an odd number of red 
balls when drawing at random k balls, without replacement, from a box 
containing n/2 red balls and n/2 blue balls. This completes the proof of the 
theorem • 



7 The weak law of large numbers 
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7.1 Taking limits as n — > oo 

In this section we compute 

lim A{, 

for the three cases considered in the previous section. 

Theorem 8 If AB ^ 0, |A fe £ n - fc | = |A| fc |5| n - fc and erf/ier AB = BA or 
AB = —BA then, for all sufficiently small e > 0, 

lim AL = { ? (75) 

n-»oo II if f p y ' 

\A\ 2 

where as before, p = rfjkjjjjp ■ Moreover, if AB = 0, i/ien Ve > 0, 

if (f = and \A\ < \B\) or (f = 1 and |A| > 
1/2 if |A| = |5| and either f = or f = 1 

1 otherwise 



lim A£, 

ra— >oo 



(76) 

Proof 

For the first part we use equations, ( |60[ ) and (|6^). By the usual gaussian 
approximation for the binomial probabilities (e.g. see [?] p. 59) we have that 
for any integers < k\ < k<i < n and any function g n with finite expectation 
with respect to the standard gaussian, 

k2 k2-np 2 

b n (k)g n (k) = g n (np + Xy/npq)—^=e^- dx (l + o(n°)j (77) 

thus, taking k\ = n(f — e), k% = n{f + e), g n {y) = 1 for < y < n and 
9n(y) = outside [0,n] we obtain from equation (p0) that, 



n{f-p + e) _ 2 

lim =1- lim / ^=e^dx (78) 

n— >oo "' e n— >oo y n(/-p-«Q ,/27r 



hence, when f ^ p for any < e < |/ — p| the limits of the integral in 
equation ((7^) are both positive or both negative and both going to oo as 
n — * oo so the desired limit is 1 — = 0. On the other hand when f = p for 
any e > the desired limit is 1 — 1 = and this shows that ( |75| ) is true for 
the commutative case. To show (|7q) for the anticommutative case we take 



9n{y) = \y 2 {y-if (79) 
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which increases like y 6 and therefore it has finite expectation with respect 
to the standard gaussian. If we show that 

g n (k) = n 2 (l-2X n (k)) 2 + o(n°) (80) 



(81) 



then it will follow from (gOJ), ([77]) and (||) that, 

"(/-P+<0 2 

lim A£ = 1 — lim 

J - np g n {np + x^/Wpq)^=e 2 <ix 



and by the same reasoning as in the commutative case we obtain ( |75|) for 
the anticommutative case. Let us then show (|80|). Notice that from ( |74| ) we 
can write, 

2 x 

A„(fc) = ^ W(n, 2t + 1, A;) (82) 



t=o 



where the hypergeometric probabilities, 
fn/2\ ( n/2 \ 



W(n, m, k) 



in 



k — m , 



n 
k, 



~±(n/2) 






1 


( n / 2 w 








n k m \k — ml 




-ifnY 
n k \ k J 





(83) 



11/1 1 \ / 1 m—l \ 
ml 2 V 2 n / V 2 n / 




1 1 / 1 1\ (1 k—m—l\ 
(fe-m)!2V2 n/ V2 n J 




2)... (1 -*=!)" 





expanding the products up to terms of order (1/n) and letting W = W(n, m, k) 
we have, 

2- m {l - + o(n- 1 )}] \2 m ~ k {\ - (fc - m)(fc - m ~ 1) + oin- 1 )} 



W 



k x 



1 



1 [m 2 + (k — mf -k]+ o{n~ l ) \\ 1 + 



n 



fc(fc-l) 



n 



(m) Q) fc {l + ^"-)+^- 1 )} 



+ o(n~ 1 ) 
(84) 



21 



We can readily check that, 



E 

t=0 



' k 

,2i + l 



(85) 



and that, 



5^(2t + l)(fc-2t-l) 
t=o 



' k ' 
2t+l. 



- 1)2 



k-l 



(86) 



From (H), (H), (H) and (H) we have 



(87) 



Squaring both sides of (|87 ) and multiplying through by 4n we obtain, 

1 



n 2 (l - 2A n (/t)) 2 = -k 2 (k - l) 2 + o(n°) 

which is exactly (^0|). This ends the proof for the anticommutative case. 
The second part of the theorem i.e. (76) follows directly from (|62^) by taking 
limits as n — > oo» 



7.2 Flipping an infinite number of coins 

As in standard probability theory there is a subtle nuisance with limits 



such as flTq ) and (76) that needs to be faced in order to have a straight 



probabilistic interpretation for laws of large numbers. The problem with 



( |75|) and (76) is that it is not clear how to paste all the ifj n together into 
one global V'oo- It was due to these kind of problems that modern measure- 
theoretic probability theory was born. 

To be able to make statements about infinite sequences of bernoulli trials 
we need to specify a boolean er-algebra, A 00 , that contains at least those 
statements. This can be done as in standard probability theory (e.g. see 
|J), i.e. A 00 is defined as the smallest <T-algebra containing the cylinder 
sets, in particular it contains the propositions P% defined in (|45|) but now 
n refers to the first n repetitions in an infinite sequence of bernoulli tri- 
als. Having constructed A °° we also need to construct the Hilbert space, 
H(A °°), containing the functions ip = V\x>- Again, the construction is not 
trivial but well known in functional analysis as the standard construction of 
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an infinite tensor product of Hilbert spaces (e.g. see |7|]). These standard 
constructions allow us to write, 



= P^ n (89) 



where ip = Voo 6 H(A °° ). Equation ( p9|) can be used to re- write the 
statements (f75|) and (|76|) as, 



Theorem 9 Lei X°° be the space of infinite sequences of independent tosses 
of a coin and let A 00 be the smallest a -algebra containing all the propositions 
P k about elements in X°° . If for each toss the tp values for falling heads 
and tails are the Clifford numbers A and B satisfying, 

1. \A\ 2 + \B\ 2 = 1 

2. AB^O 

3. either AB = BA or AB = -AB 

4. \A k B n ~ k \ = \A\ k \B\ n - k Vra G 1N,V0 < k < n. 
Then, for all sufficiently small e > the propositions, 

p\A\ 2 p A oo 

are true. 
Proof 

Under the conditions of the theorem we have from ( |89| ) and (75) that when 
the ip n are normalized i.e. when \\ip n \\ = 1 for all n then, 

U - Plfn - as n -» oo 

or equivalently, 

lim o PlfiP = P^i> = i> (90) 
\A\' 2 

so that tp is an eigen vector of the operator Ptx3,e with eigen value 1 and 
thus, it makes the proposition true • 
We also have, 

Theorem 10 Let X°° and A 00 be as in the previous theorem but now sup- 
pose that the Clifford numbers A and B satisfy, 

1. AB = 
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2. \A\ > \B\ 
Then for all e > the propositions, 

pi p A OO 

are true. 
Proof 

Under the conditions of the theorem we have from (|89| ) and (76) that when 
the tp n are all of unit norm then 

HV> - ^eV'll * as 71 > oo 

or equivalently, 

n limP^ = P^^ (91) 

so that ip is an eigen vector of the operator e with eigen value 1 and 
thus, it makes the proposition true • 



7.3 Interpretation and examples 

The previous two theorems can be interpreted as in standard probability 
theory. They say that an infinite sequence of independent tosses of a coin 
with ip{ heads ) = A and ip( tails ) = B will have for sure (relative to -0) 
a frequency of heads within e from \A\ 2 in the first case and within e from 
1 in the AB = case. When AB = the theorem assures us that (again 
relative to ip) the coin will show up heads with frequency 100% whenever 
\A\ > \B\ ! 

The four conditions on A and B that are needed for the AB ^ case, 
impose heavy restrictions on the possible values that A and B can take but 
there are lots of examples. Let p be a real number in the interval [0, 1] and 
consider, 

Example 1 

A = VP B = ^T^p (92) 

Example 2 

A = VP B = ^/T^pB (93) 

where B = ct\G2 . ■ .o~ r is a unit blade, i.e. it can be factorized into a 
product of orthogonal (anticommuting) unit vectors o~j . 
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Example 3 

A = v^i B = y/T^pB (94) 
where A and B are both unit blades possibly of different dimensions. 
Example 4 

A = y/p e ia A B = y/T^p e lp B (95) 

where a and (3 are scalars, A and B are both unit blades and i is any 
multivector such that i 2 = — 1 and i commutes or anticommutes with 
both A and B i.e. \A = ±Ai and iB = ±Bi 

It can be readily check that all these examples satisfy the four conditions of 
the theorem and hence, coin tosses with these ips will show up heads with 
probability p. 

7.4 Why isn't every one a frequentist? 

For the same reason as in probability theory these laws of large numbers 
can not be used to define what we mean by the partial truth that the coin 
will show up heads in the next toss since the theorem only says that the 
propositions e are made true by V- So any attempt to use the law of 
large numbers as the definition of what tp is, or means, is therefore circular. 

8 The Boolean algebra of Caticha's temporal fil- 
ters 

Let X be a set and let B be a cr-algebra of subsets of X. Notice that we are 
using the standard set notation for the elements of B instead of the logical 
notation used in the rest of the paper. The reason for changing the notation 
is that the boolean cr-algebra that we are trying to define is not B itself but 
only based on B . Think of X as the set of possible locations for a point 
particle and define the elementary propositions e(x, t) by the statement: the 
particle is at location x at time t. As in J2|, e(x,t) is a pure hypothesis not 
the result of a measurement. The truth value of e(x, t) can be obtained, at 
least in principle, by imagining a filter that covers all of X except at location 
x where it has an infinitesimal hole. This magical filter materializes only for 
an instant at time t and then disappears leaving no trace of its existence. If 
after time t we still find the particle somewhere then we conclude that e(x, t) 
is true. These filters form a boolean algebra with the definitions below. 
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Let T be a subset of the real line and define for t G T and B £ B the 
proposition e(i?, t) as: an elementary filter at time t with B open. Thus, 
e(B, t) is true if and only if the statement: the particle is somewhere in B 
at time t is true. We define the logical product of two elementary filters as 
the operation of putting one on top of the other and we define the negation 
of an elementary filter as the filter that closes the holes and opens the rest. 
In symbols: 

e{B 1 ,t)e{B 2 ,t) = e{B x ^B 2 ,t) (96) 
e(B,t) = e(B,t) (97) 
e(Bi,t) + e(B 2 ,t) = e{B 1 UB 2 ,t) (98) 

where B = X \ B is the complement of B with respect to X. Notice that 
fl98D follows from (|96|) and (|97|) by using De'Morgan's law i.e., 



e(B x ,t) + e{B 2 ,t) = e(B u t)e(B 2 ,t) 
= e(B 1 UB 2 ,t) 

We also have that for all s,t G T, 

e(B\, s)e(B 2 ,t) = "Filter at time s followed (or on top of) filter at time t" 

e(J3i, s) + e(B 2 ,t) = "Filter at time s OR filter at time t" 

e(cj), t) = "Barrier (nothing open) at time t" = (99) 

e(X,t) = "Absence of filter (all open) at time t" = 1 (100) 

We define T as the smallest c-algebra containing the elementary filters 
e(B, t) i.e., 

T = a {e(B, t) : B G B ,t G T} (101) 

The boolean algebra of temporal filters T is a spell out of the usual algebra 
of events of a stochastic process with state space X. 



8.1 The Markov Property 

Due to the fact that there is no product rule for the unnormalized i/js we 
cannot make use of the standard Markov property of probability theory di- 
rectly. The following definition is all that is needed to recover non relativistic 
quantum mechanics, 

Definition: -0 G H(J- ) is said to have independent segments given c G T 
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if for all n = 1, 2, . . ., all times t\ < t 2 < ■ ■ ■ < t n in T and all locations 
x%, x 2 , ■ ■ ■ , x n in X the propositions 

e(xi,ti)e(x 2 ,t 2 ), e(x 2 ,t 2 )e(x 3 ,t 3 ), . . . , e(z n _i, t n _i)e(x n _i, t n _i) 

are independent given c. 

8.2 Time evolution and the Schrodinger equation 

When ip £ H{J- ) has independent segments, it evolves according to the 
Schrodringer equation. The usual jargon of quantum mechanics is recovered 
with the notation, 

Probability Amplitude: ip(e(x, s)e(y, t), e(xo, to)) is the amplitude for the 
particle to go from location x at time s to location y at time t > s given 
that it was initially prepared at location xq at time to- We denote this 
amplitude by K(y, t; x, s). 

Wave Function: ip(e(x, t), e(xo, to)) is the amplitude of going from the ini- 
tial position to location x at time t. It is often denoted by just ^f(x,t). 

Thus, with this notation, a particle which is prepared by e(xo,to) and for 
which ip £ H{T ) has independent segments conditionally on this prepara- 
tion, will satisfy, 



1 



tt(z, t) = J2~ [K(x, t; y, s)*(y, s) + s)K(x, t; y, s)] 



(102) 



ydX 



since 



= '^2^(.e{x,t)e(y,s),e{x ,t )) 
vex 

= ^2 ^(Mx ,t )e(y,s)} [e(y,s)e(x,t)],e(x ,t )) 

y&X 



taking derivatives in (102) with respect to t and evaluating at t = s we 
obtain, 



<9*(x,t) 



Ot 



^ 2 



dK(x,t;y,s) 



at 



y(y,s) + t>(y,s) 



dK(x,t;y,s) 



t=s 



Of 



Defining the Hamiltonian H by, 

8K(x,t;y,s) 



dt 



t=s 



■~H(x,y,s) 



(103) 
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where % is any multivector that squares to —1 and that it commutes with all 
the tps. Relabeling s with t we can write Schrodinger equation for possible 
non-commuting ^s as, 



ih m^i = ^ I [H ( X) y} t ) + *(y, t)H(y, x, t)} (104) 



at 



y&X 



when the wave functions ^ commute with the Hamiltonian, (e.g. when all 
the ips take values in a commutative subspace of Q ) (104) reduces to the 
usual Schrodinger equation. 



9 Next: 

Using the Spacetime algebra How to connect the above with the Dirac- 
Hestenes equation. 

ij) assignments in the real continuous case Minimum Fisher informa- 
tion and the Huber-Frieden derivation of the time independent Schrodinger 
equation. 

ip and Brownian motion Nagasawa's diffusion model. 

Comments and conclusion What the hell is this all about and what it 
may be likely to become.... 
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